SEQ ID NO: 19 



RESULT 7 

G28289/C 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
JOURNAL 

COMMENT 



G28289 383 bp DNA STS 29-JUN-1996 

human STS SHGC-34439, sequence tagged site. 

G28289 

G28289.1 GI:1397008 

STS; STS sequence; primer; sequence tagged site, 
human . 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 383) 
Myers, R.M . 
Unpublished (1996) 

Contact: Richard M. Myers 

Stanford Human Genome Center (SHGC) 

Stanford University School of Medicine 

Department of Genetics, M-344, Stanford, CA 94305, USA 
Tel: 4157259687 
Fax: 4157259689 

Email : myers@shgc . Stanford . edu 



Primer A: CCACTGGCTAGAAGCCCTG 
Primer B: TACCCACTGACTCTTCCAATCA 
STS size: 126 
PCR Profile: 

Initial incubation: 94 degrees C for 90 seconds 



Protocol : 



Denaturation: 
Annealing: 
Polymerization : 
PCR Cycles: 
Thermal Cycler: 

L: 

Template : 
Primer : 
dNTPs : 

Taq Polymerase: 
Total Vol: 



94 degrees C for 15 
62 degrees C for 23 
72 degrees C for 30 
30 

Perkin Elmer 9600 

25 ng 
each 1 uM 
each 200 uM 
0.05 units/ul 
10 ul 



seconds 
seconds 
seconds 



Buffer: 



MgC12 : 
KC1: 

Tris-HCl: 
P H: 



2.5 mM 
50 mM 
20 mM 
8.3 



Prepared with primer pairs provided by Sandoz, derived from H41147 
-- Washington University/Merck EST sequence. 
FEATURES Location/Qualifiers 
source 1. .383 

/organism="Homo sapiens" 
/db_xr e fs»t axon : 9 6 0 6 " 
/map="12" 
STS 21. .146 

primer_bind 21. .39 

primer_bind complement (125 . .146) 
BASE COUNT 101 a 73 c 95 g 107 t 7 others 

ORIGIN 



Query Match 27.3%; Score 313.2; DB 54; Length 383; 

Best Local Similarity 94.9%; Pred. No. 1.3e-82; 

Matches 352; Conservative 0; Mismatches 15; Indels 4; Gaps 
Qy 251 ttacactggatgtg-ctagagactgactgcc--atgtgctcagaaagaaggcatggcaag 307 

i minium iimiiiiiiinii miiiiiiiimiim n 

Db 371 TAACACTGGATGTGCCTAGAGACTGACTGCCCATGTGCCTCAGAAAGAAGGCATGGCNAG 312 



Ov 


308 


Db 


311 


Qy 


368 


Db 


251 


Qy 


428 


Db 


191 


Qy 


488 


Db 


131 


Qy 


548 


Db 


71 


Qy 


607 


Db 


11 



actgtggaatgaggatattttttgaatcagtttatggtcaatgcaaagcaatattttata 367 

llllllllllllllllllll IIIIIIIIIIIIIIIIIIMIMIIIIIIMI IIIMI 

ACTGTGGAATGAGGATATTTNTTGAATCAGTTTATGGTCAATGCAAAGCAATACTTTATA 2 52 

tgaacaacccaagtagagttctctatttagctgcttataactgtactcttcgcccagttt 42 7 

I I I I IMMIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIMIIIIIIIIII 
TGAANAACCCAAGTAGAGTTCTCTATTTAGCTGCTTATAACTGTACTCTTCGCCCAGTTT 192 

caaaaaaaaagatttacatgacgtgccctgactgcccaagctccatacccactgactctt 4 87 

1 1 1 1 1 1 MM 1 1 1 1 1 1 1 1 1 1 1 1 M I M 1 1 1 1 1 M M 1 1 1 1 1 1 1 !M<; M 1 1 M I M 1 1 1 

CAAAAAAAAAGATTTACATGACGTGCCCTGACTGCCNAAGCTCCATACCCACTGACTCTT 132 

ccaatcaccaagtgctggaggctgccaccgagtctcttgcgaaatacaacaatgagaaca 54 7 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

C C AATC AC C AAGTGCTGG AGG CTG C C ACCG AGT CT CTTGCGAAAT AC AAC AATG AG AAC A 7 2 



1 1 1 1 1 ! 1 " i 1 1 1 1 1 1 1 i I M minim: 
ltccaagcagtattctctcttcaaagtcacccagggcttc: 

:ttcttactt 617 

IIIMIMI 



RESULT 3 
N39010 

LOCUS N39010 542 bp mRNA EST 19-JAN-1996 

DEFINITION yv22a08.sl Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone 

IMAGE: 243446 3', mRNA sequence. 
ACCESSION N39010 
VERSION N39010.1 GI : 1162217 

KEYWORDS EST . 
SOURCE human . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 542) 

AUTHORS Hillier,L., Clark, N., Dubuque, T., Elliston,K., Hawkins, M., Holman 

, M., Hultman,M., Kucaba,T., Le,M., Lennon,G., Marra,M., Parsons, J., 

Rifkin,L., Rohlfing,T., Soares, M., Tan,F., Trevaskis , E . , Waterston 

, R., Williamson, A. , Wohldmann,P. and Wilson, R. 
TITLE The Washu-Merck EST Project 
JOURNAL Unpublished (1995) 
COMMENT Contact: Wilson RK 

Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108 

Tel: 314 286 1800 

Fax: 314 286 1810 

Email: est@watson.wustl.edu 

High quality sequence stops: 476 

Source: IMAGE Consortium, LLNL 

This clone is available royalty- free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
Seq primer: ml3 -40 forward 
High quality sequence stop: 476. 
FEATURES Location/Qualifiers 
source 1 . . 542 

/organism= "Homo sapiens" 

/db_xref="GDB: 3 792 5 79" 

/ db_xr e f = " t axon : 9 6 0 6 " 

/clone=" IMAGE: 243446" 

/clone_lib=" Soares fetal liver spleen 1NFLS" 
/sex="male" 

/dev_stage="20 week-post conception fetus" 
/lab_host="DH10B {ampicillin resistant)" 

/not e= "Organ: Liver and Spleen; Vector: pT7T3D (Pharmacia) 
with a modified polylinker; Site_l: Pac I; Site_2: Eco RI ; 



BASE COUNT 
ORIGIN 



1st strand cDNA was primed with a Pac I - oligo(dT) primer 

double- stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia) , digested with Pac I and cloned into the Pac I 
and Eco RI sites of the modified pT7T3 vector. Library- 
went through one round of normalization. Library 
constructed by Bento Soares and M.Fatima Bonaldo." 
135 a 159 c 123 g 124 t 1 others 



Query Match 42.4%; Score 486.4; DB 159; Length 542; 

Best Local Similarity 99.6%; Pred. No. l.le-133; 

Matches 4 98; Conservative 0; Mismatches 1; Indels 1; Gaps 



Ov 


647 


Db 


1 


Ov 


707 


Db 


61 


Ov 


767 


Db 


121 


Qy 


827 


Db 


180 


Qy 


887 


Db 


240 


Qy 


947 


Db 


300 


Qy 


1007 


Db 


360 


Qy 


1067 


Db 


420 


Qy 


1127 


Db 


480 



II 



III 



I J 1 1 1 1 1 1 



tttgcaaaggttctctgactcgaacacactgggaaaagtttgtctctgtgacttgtgact 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII1IIII 

TTTGCAAAGGTTCTCTGACTCGAACACACTGGGAAAAGTTTGTCTCTGTGACTTGTGACT 

tctttgaatcacaggctccagccactggaagtgaaaactctgctgttaaccagaaaccta 

lllllllllllllllllllll lllllllllillllllllllllllilllllllllllll 
TCTTTGAATCACAGGCTCCAG - CACTGGAAGTGAAAACTCTGCTGTTAACCAGAAACCTA 

caaaccttcccaaggtggaagaatcccagcagaaaaacacccccccaacagactccccct 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
C AAACCTT C C C AAGGTGGAAG AATCC C AG C AG AAAAAC ACC C C C C C AAC AG ACT C C C C CT 

ccaaagctgggccaagaggatctgtccaatatcttcctgacttggatgataaaaattccc 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
C C AAAGCTGGG C C AAG AGG AT CTGT C C AAT ATC TT C C TG ACTTGG ATG AT AAAAATTC C C 



gagaaaccctggatatttccttcctcttcctggagcctatggaggagaagctggttgtcc 

IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GAGAAACCCTGGATATTTCCTTCCTCTTCCTGGAGCCTATGGAGGAGAAGCTGGTTGTCC 



766 



120 



826 



179 



886 



239 



946 



299 



1006 



aggaaaagggccctcaggaggcctttcctgtgcatctggacctaaccacgaatccccagg 
I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1: M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AGGAAAAGGGCCCTCAGGAGGCCTTTCCTGTGCATCTGGACCTAACCACGAATCCCCAGG 3 59 



1066 



419 



tgcctttccccaaagaaaaagcacgcactgctgagtgcccagggccagcccagaatgcca 1126 

mmilllll M l 1 1 M I 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1: - M M 1 1 1 1 1 1 ; Ml I ! 

TGCCTTTCCCCAGAGAAAAAGCACGCACTGCTGAGTGCCCAGGGCCAGCCCAGAATGCCA 479 



II 



SEQ ID NO: 37 



RESULT 10 

G27766/C 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 



G27766 503 bp DNA STS 29-JUN-1996 

human STS SHGC-33193, sequence tagged site. 

G27766 

G27766.1 GI:1396485 

STS; STS sequence; primer; sequence tagged site, 
human . 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 503) 



AUTHORS 
JOURNAL 
COMMENT 



Myers, R.M. 
Unpublished (1996) 

Contact: Richard M. Myers 

Stanford Human Genome Center (SHGC) 

Stanford University School of Medicine 

Department of Genetics, M-344, Stanford, CA 94305, USA 
Tel: 4157259687 
Fax: 4157259689 

Email: myers@shgc.stanford.edu 



Primer A: AGTATCTTTATAAGTGACGTGGCTG 
Primer B: GGGCTGCACTCTACCCTGTA 
STS size: 125 
PCR Profile: 

Initial incubation 



94 degrees C for 90 seconds 



Protocol : 



Denaturat ion : 
Annealing: 
Polymerization: 
PCR Cycles : 
Thermal Cycler: 

Template : 
Primer : 
dNTPs : 

Taq Polymerase: 
Total Vol: 



94 degrees C for 15 
62 degrees C for 23 
72 degrees C for 30 
30 

Perkin Elmer 9600 

25 ng 
each 1 uM 
each 2 00 uM 
0.05 units/ul 
10 ul 



seconds 
seconds 
seconds 



Buffer: 



MgCl2 : 
KC1: 

Tris-HCl : 
pH: 



2.5 mM 

50 mM 

20 mM 
8.3 



Prepared with primer pairs provided by Sandoz, derived from T74769 
-- Washington University/Merck EST sequence. 
FEATURES Location/Qualifiers 
source 1. .503 

sapiens" 
9606" 



STS 

primer_bind 
primer_bind 

BASE COUNT 

ORIGIN 



/organism= " Homo 
/db_xref = 11 taxon : 
/map="3" 
13. .137 
13. .37 

complement (118 . , 
118 a 119 c 145 



137) 
9 



112 t 



9 others 



Query Match 19.9%; 
Best Local Similarity 84.6%; 

427 ; Conservative 



Matches 


Qy 


1011 


Db 


503 


Qy 


1069 


Db 


443 


Qy 


1124 


Db 


383 


Qy 


1177 


Db 


323 



Score 298.4; DB 54; 
Pred. No. 4e-79; 
0; Mismatches 60; 



Length 503; 



Indels 



18; 



Gaps 



n I I 



M 



III M M ill; i i in 1 1 ii I I I 



1 1 li III 1 1 III 1 1 1 III 1 1 III M I! 1 1 1 1 



:ttcccc aaagaaaaag 1123 

ii hi i i in 



icgcactgctga gtgcccagggccagcccagaatgccagccctcttgt cctt 1176 

iiiiiiiiin I ii ii I iiiiiiiiiii ii ii i ii 



1 1 Mill 1 1 III 1 1 1 III ill 1 1 i ir: i ill 1 1 ii : in 1 1 mi i ; in 



Qy 



1237 gcgatggggacgatggacagagacagagcgtgcacacgt agagt ggct agtgaaggacgc 1296 

MM I MM 1 1 MM 1 1 III 1 1 MM ,MM I 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 264 GCGATGGGGACGATGGACAGAGACAGAGCGTGCANACGTAGAGTGGCTAGTGAAGGACGC 205 
Qy 1297 ctttttgactcttcttggtctcagcatgttgactgggattggaaataatgagactgagcc 1356 

I 1 1 1 1 1 1 1 II !; , M 1 1 1 1 1 1 1 1 1 1 1 M: i i I 1 1 1 1 1 1 1 M , i I 1 1 1 1 1 1 1 !; I ! I 

Db 204 CTTTTTGACTCTTCTTGGTCTCAGCATGTTGACTGGGATTGGAAATAATGAGACTGA - NC 146 
Qy 1357 ctcggcttgggctgcactctaccctgtacactgccttgtaccctgagctgcatcacctcc 1416 

1 1 1 1 1 1 1 Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1: i lllllllllllllllllllllllllllllll 

Db 145 CTCGGCTTGGGCTGCACTCTACCCTGTANACTGCCTTGTACCCTGAGCTGCATCACCTCC 86 
Qy 1417 taaactgagcagtc-tcataccatggagagatgcctctcttatgtcttcagccac-tcac 1474 

iiimiiiiiiii ; i 1 1 1 1 1 1 1 : . 1 1 1 1 1 1 1 1 :,.,!! 1 1 1 1 1 1 mm 

Db 85 TAAACTGAGCAGTCTTCATACCATGGAGAGATGCCTCTCTTATGTCTTCAGCCACGTCAC 26 

Qy 1475 ttataaagatacttatcttttcagc 1499 

llllllllllllll lllllllll 
Db 25 TT AT AAAG AT ACTTCNCTTTT C AG C 1 



09445258Results 



RESULT 5 
A2HS_HUMAN 

ID A2HS_HUMAN STANDARD; PRT; 36 7 AA. 

AC P02765; 014962; 014961; 

DT 21-JUL-1986 (Rel. 01, Created) 

DT 01-APR-1988 (Rel. 07, Last sequence update) 

DT 01-OCT-2000 {Rel. 40, Last annotation update) 

DE ALPHA- 2 -HS -GLYCOPROTEIN PRECURSOR (FETUIN-A) (ALPHA-2 -Z-GLOBULIN) 

DE (BA -ALPHA- 2 -GLYCOPROTEIN) . 

GN AHSG OR FETUA. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chorda t a; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=87260816; PubMed=34 746 08 ; 

RA Lee C.-C, Bowman B.H., Yang F. ; 

RT "Human alpha 2-HS-glycoprotein: the A and B chains with a connecting 

RT sequence are encoded by a single mRNA transcript."; 

RL Proc. Natl. Acad. Sci. U.S.A. 84:4403-4407(1987). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=97464058; PubMed=9322749 ; 

RA Osawa M. , Umetsu K. , Sato M . , Ohki T., Yukawa N., Suzuki T., 

RA Takeichi S . ; 

RT "Structure of the gene encoding human alpha 2-HS glycoprotein 

RT (AHSG) 

RL Gene 196:121-125(1997). 

RN [3] 

RP SEQUENCE OF 19-300. 

RX MEDLINE=86111834; PubMed=3944104 ; 

RA Yoshioka Y., Gejyo F . , Marti T., Rickli E.E., Burgi W., Offner G.D., 

RA Troxler R.F., Schmid K. ; 

RT "The complete amino acid sequence of the A- chain of human plasma 

RT alpha 2HS-glycoprotein . " ; 

RL J. Biol. Chem. 261:1665-1676(1986). 

RN [4] 

RP SEQUENCE OF 341-367. 

RX MEDLINE=83161108; PubMed=6833285 ; 

RA Gejyo F. , Chang J.-L., Burgi W., Schmid K. , Offner G.D., Troxler R.F 

RA van Halbeek H., Dorland L . , Gerwig G.J., Vliegenthart F.G.; 

RT "Characterization of the B-chain of human plasma alpha 

RT 2HS-glycoprotein. The complete amino acid sequence and primary 

RT structure of its heteroglycan. " ; 

RL J. Biol. Chem. 258:4966-4971(1983). 

RN [5] 

RP DISULFIDE BONDS. 

RX MEDLINE=89150282; PubMed=2645 941 ; 

RA Araki T. , Yoshioka Y., Schmid K. ; 

RT "The position of the disulfide bonds in human plasma alpha 2 HS- 

RT glycoprotein and the repeating double disulfide bonds in the domain 

RT structure . " ; 

RL Biochim. Biophys. Acta 994:195-199(1989). 

RN [6] 

RP DISULFIDE BONDS. 

RX MEDLINE=89340518; PubMed=2760061 ; 

RA Kellerman J., Haupt H., Auerswald E.-A., Mueller-Esterl W.; 

RT "The arrangement of disulfide loops in human alpha 2-HS glycoprotein 

RT Similarity to the disulfide bridge structures of cystatins and 

RT kininogens . " ; 

RL J. Biol. Chem. 264:14121-14128(1989). 

RN [7] 

RP SEQUENCE OF 34-367 FROM N.A., AND VARIANT AHSG* 2 . 

RC TISSUE=Liver; 

RX MEDLINE=97157105; PubMed=9003486 ; 

RA Osawa M., Umetsu K., Ohki T., Nagasawa T., Suzuki T., Takeichi S.; 

RT "Molecular evidence for human alpha 2-HS glycoprotein (AHSG) 

RT polymorphism. 11 ; 



1/ 



RL 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 



Hum. Genet. 99:18-21(1997). 

-!- FUNCTION: PROMOTES ENDOCYTOSIS, POSSESSES OPSONIC PROPERTIES AND 
INFLUENCES THE MINERAL PHASE OF BONE. SHOWS AFFINITY FOR CALCIUM 
AND BARIUM IONS. 

-!- SUBUNIT: ALPHA- 2 -HS GLYCOPROTEIN DERIVES FROM THIS PRECURSOR, 

WHEN THE CONNECTING PEPTIDE IS CLEAVED OF. THE TWO CHAINS A AND 
B ARE HELD TOGETHER BY A SINGLE DISULFIDE BOND. 

- ! - SUBCELLULAR LOCATION : SECRETED . 

-!- TISSUE SPECIFICITY: SYNTHESIZED IN LIVER AND SELECTIVELY 

CONCENTRATED IN BONE MATRIX. IT IS ALSO FOUND IN DENTIN IN MUCH 
HIGHER QUANTITIES THAN OTHER PLASMA PROTEINS. 

-!■- POLYMORPHISM: THERE ARE TWO COMMON ALLELES, AHSG*1 AND AHSG*2 . 
AHSG*1 HAS THR-248 AND THR-256; AHSG*2 HAS MET-248 AND SER-256 . 

-!- SIMILARITY: BELONGS TO THE FETUIN FAMILY. 

-!- SIMILARITY: CONTAINS 2 CYSTATIN- LIKE DOMAINS. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 



EMBL; Ml 6 961; 
EMBL; D67013; 
EMBL; D67012; 
PIR; A29081; 
PIR; S04467; 
PIR; S02765; 
GlycoSuiteDB; 



AAA51683 , 
BAA22652 , 
BAA22651. 

WOHU. 

S04467 . 

S02765. 
P02765; 



CarbBank; CCSD: 11752; 
CarbBank; CCSD: 36455; -. 
SWISS -2DPAGE; P02765; HUMAN. 
MIM; 138680; -. 
InterPro; IPR000010; -. 
InterPro; IPR001363; -. 
Pfam; PF00031; cyst at in; 2. 
PROSITE; PS01254; FETUIN_1 ; 1. 
PROSITE; PS01255; FETUIN_2 ; 1. 

Glycoprotein; Signal; Repeat; Plasma; Mineral balance; 



Polymorphism . 



FT 


SIGNAL 


1 


18 




FT 


CHAIN 


19 


300 


ALPHA- 2 -HS -GLYCOPROTEIN, CHAIN 


FT 


PEPTIDE 


301 


340 


CONNECTING PEPTIDE. 


FT 


CHAIN 


341 


367 


ALPHA- 2 -HS -GLYCOPROTEIN, CHAIN 


FT 


DOMAIN 


27 


144 


CYSTATIN- LIKE 1. 


FT 


DOMAIN 


145 


260 


CYSTATIN- LIKE 2. 


FT 


DISULFID 


32 


358 


INTERCHAIN. 


FT 


DISULFID 


89 


100 




FT 


DISULFID 


114 


132 




FT 


DISULFID 


146 


149 




FT 


DISULFID 


208 


219 




FT 


DISULFID 


230 


247 




FT 


CARBOHYD 


156 


156 


N- LINKED (GLCNAC. . .). 


FT 


CARBOHYD 


176 


176 


N- LINKED (GLCNAC. . . ). 


FT 


CARBOHYD 


256 


256 


0- LINKED (GALNAC. . . ) . 


FT 


CARBOHYD 


270 


270 


0- LINKED (GALNAC. . .). 


FT 


CARBOHYD 


346 


346 


O-LINKED (GALNAC. . .). 


FT 


VARIANT 


248 


248 


T -> M (IN AHSG*2) . 


FT 








/FTId=VAR_002388 . 


FT 


VARIANT 


256 


256 


T -> S (IN AHSG*2) . 


FT 








/FTId=VAR_002389. 


FT 


CONFLICT 


16 


16 


C -> W (IN REF. 2) . 


FT 


CONFLICT 


54 


54 


W -> K (IN REF. 3) . 


FT 


CONFLICT 


125 


125 


F -> S (IN REF. 7) . 


FT 


CONFLICT 


204 


204 


S -> C (IN REF. 2) . 


SQ 


SEQUENCE 


367 AA; 


39324 


MW; 1AAF0C8D6B7E2789 CRC64; 



Query Match 13.6%; Score 274; DB 1; Length 367; 

Best Local Similarity 26.2%; Pred. No. 4.2e-13; 



Matches 102; Conservative 64; Mismatches 169; Indels r 54; Gaps 17; 



Qy 11 ILVLCCGAMSPPQLALNPSALLSR- -GCNDSDVLAVAGFALRDINKDRKDGYVLRLNRVN 68 

:|:|| : | : |: | |:| : | |: ||:: || ||::: 

Db 5 VLLLCLAQLWGCHSAPHGPGLIYRQPNCDDPETEEAALVAIDYINQNLPWGYKHTLNQID 64 

Qy 69 DAQEYRRGGLGSLFYLTLDVLETDCHVLRKKAWQDCGMRIFFE-SVYGQCK-AIFYMNNP 126 

: : : : I II : :| III I I I I I =1 I • I I I = =: 

Db 65 EVKWPQQPSGELFEIEIDTLETTCHVLDPTPVARCSVRQLKEHAVEGDCDFQLLKLDGK 124 

Qy 127 S R VL YLAAYNCT LR P VS KKK I YMT CPDCPSSIPTDSS NHQ VL E AAT E S LAKYNN ENT S KQ 186 

hi I I I : : Mil I : : : : II :|| :| :| 

Db 125 FSWYA KCDSS PD S AE D VRKVCQD C P LLA P LND T - - R WHAAKAALAA FNAQNNG S N 179 

Qy 187 YSLFKVTRASSQWV-VGPSYFVEYLIKESPCT KSQASSCSLQSSDSVPVGLCKGSLT 242 

: | :::|| | | : || :||: : : | ::|: |:| : | || :|: 

Db 180 FQLEEISRA- -QLVPLPPSTYVEFTVSGTDCVAKEATEAAKCNLLAEKQ- - YGFCKATLS 235 

Qy 243 RTHWEKFVSVTCDFFESQAPATGSENSAVNQK-PTNLPKVEESQQKNTPPTDSPSKAGPR 301 

hill 1-1 : = h || I : HI • I I 

Db 236 EKLGGAEVAVTCTVFQTQPVTSQPQPEGANEAVPT PWDPDAPPSPPLGAPG-LPPA 2 91 

Qy 302 GSVQYLPDLDDKNSQEKGPQEAFPVHLDL TTNPQGETLDISFLFLEPMEEK 352 

II II : I I III H II 
Db 292 GSP PD S H VL LAA P PGHQ - LHRAH YD LRHT FMG WS LGS P S G E 332 

Qy 353 LWLP F P KE KART AE C PG P AQNAS P L VL P 381 

= h III I I hi I 

Db 333 ---VSHPR-KTRTWQPSVGAAAGPWPP 357 

RESULT 6 
A2HS_RABIT 

ID A2HS_RABIT STANDARD; PRT; 360 AA. 

AC P80191; 018997; 

DT 01-JUL-1993 {Rel. 26, Created) 

DT 01-OCT-2000 (Rel. 40, Last sequence update) 

DT 01-OCT-2000 {Rel. 40, Last annotation update) 

DE ALPHA- 2 -HS -GLYCOPROTEIN PRECURSOR (FETUIN-A) (HAEMONECTIN) (FRAGMENT) . 

GN AHSG OR FETUA. 

OS Oryctolagus cuniculus (Rabbit) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

OX NCBI_TaxID=9986 ; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC STRAIN=NEW ZEALAND WHITE; TISSUE^Liver ; 

RA Osawa M., Saito T. , Takeichi S.; 

RT "Nucleotide sequence of cDNA encoding rabbit fetuin."; 

RL Submitted (SEP-1995) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE OF 19-26; 55-62; 118-128; 235-250 AND 332-344. 

RC STRAIN=NEW ZEALAND WHITE; TISSUE=Bone marrow; 

RX MEDLINE=93238730; PubMed=7682944 ; 

RA White H., Totty N. , Panayotou G. ; 

RT "Haemonectin, a granulocytic-cell -binding protein, is related to the 

RT plasma glycoprotein fetuin."; 

RL Eur. J. Biochem. 213:523-528(1993). 

CC -!- FUNCTION: A CELL ADHESION PROTEIN THAT BINDS IMMATURE CELLS OF 

CC THE GRANULOCYTE LINEAGE. 

CC -!- SUBCELLULAR LOCATION: SECRETED. 

CC -!- TISSUE SPECIFICITY: BONE MARROW. 

CC -!- SIMILARITY: BELONGS TO THE FETUIN FAMILY. 

CC -!- SIMILARITY: CONTAINS 2 CYSTATIN- LIKE DOMAINS. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 



CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 



CC 
CC 
DR 


or send an email to license@isb-sib . ch) . 




EMBL; D67014; BAA22653.1; 








DR 


PIR; S30340; S30340. 








DR 


InterPro; 


IPR000010; - . 








DR 


InterPro; 


IPR001363; - . 








DR 


Pfam; PF00031; cystatin; 


2 . 






DR 


PROSITE; 


PS01254; 


FETUIN_ 


l; 


1. 




DR 


PROSITE; 


PS01255; 


FETUIN 


2; 


1. 




KW 


Cell adhesion; Repeat; Glycoprotein; Signal. 




FT 


NON_TER 


1 


1 








FT 


SIGNAL 


<1 


15 




BY SIMILARITY. 




FT 


CHAIN 


16 


360 




ALPHA- 2 -HS -GLYCOPROTEIN . 


FT 


DOMAIN 


24 


141 




CYSTATIN- LIKE 1. 




FT 


DOMAIN 


142 


257 




CYSTATIN- LIKE 2 . 




FT 


DISULFID 


29 


351 




BY SIMILARITY. 




FT 


DISULFID 


86 


97 




BY SIMILARITY. 




FT 


DISULFID 


111 


129 




BY SIMILARITY. 




FT 


DISULFID 


143 


146 




BY SIMILARITY. 




FT 


DISULFID 


205 


216 




BY SIMILARITY. 




FT 


DISULFID 


227 


244 




BY SIMILARITY. 




FT 


CARBOHYD 


96 


96 




N- LINKED (GLCNAC. 


. .) (POTENTIAL) 


FT 


CARBOHYD 


153 


153 




N- LINKED (GLCNAC. 


. .) (POTENTIAL) 


FT 


CONFLICT 


126 


126 




S -> P (IN REF . 1) 




SQ 


SEQUENCE 


360 AA; 


38386 


MW; F0908740610CBB95 


CRC64; 



Query Match 13.4%; Score 270.5; DB 1; Length 360; 

Best Local Similarity 25.0%; Pred. No. 7.4e-13; 

Matches 99; Conservative 51; Mismatches 181; Indels 65; Gaps 13; 

Qy 3 LLLPLALCILVLCCGAMSPPQLALNPSALLSRGCNDSDVLAVAGFALRDINKDRKDGYVL 62 

hi 1 = 1 I I : I I hi : II h II : II 

Db 1 LVLLLSLAQLWSCHLVTAVPLLGYR EHNCDDPEAEQVALLAVDHINNHLQQGYKH 55 

Qy 63 RLNRVNDAQEYRRGGLGSLFYLTLDVLETDCHVLRKKAWQDCGMR- IFFESVYGQCKAIF 121 

llh: i I I :| III II I : I : I : : I I I I 

Db 56 ILNRIDKVKVWPRRPTGEVYELEIDTLETTCHALDPTPLANCSVRQVTQHAVEGDCD- - F 113 

Qy 122 YMNNPSRVLYLAAYNCTLRPVSKKKIYMTCPDCPSSIPTDSSNHQVLEAATESLAKYNNE 181 

: : | | | : | ||||| | : : :| :j| :| :| : 
Db 114 HVLKQDGQFTVLSAKCDSTPDSAEDILKLCPDCPLLTPLNDT- - RVAQAAEAALTAFNEK 171 

Qy 182 NTSKQYSLFKVTRASSQWWGP-SYFVEYLIKESPCTKSQA SSCSLQSSDSVPVGLC 237 

I I :: II I I I I Hh : : I = = hi : II 

Db 172 NNGAYLQLVEIARA- -QLVPLPASTYVEFTVAATDCVAKEVTDPAKCNLLADKQ- -YGFC 22 7 

Qy 238 KGSLTRTHWEKFVSVTCDFFESQ APATGSENSAVNQKPTNLPKVEESQQKNTPPTD 293 

I - : I III I :| I I =1 : | |: : : | 

Db 22 8 KATVAEKVAREEVEVTCTIFPAQPWPQPQPGVAGAAAVEP APAVDPASPVSPPDGQ 2 84 

Qy 294 SPSK AGPRGSVQYLPDLDDKNSQEKGPQEAFPVHLDLTTNPQGETLDI 341 

III III II I III 
Db 285 SPSSLWGPVLVAQAPAPPRAHYDLRQTFAGVPSMESGSGEAF 32 7 

Qy 342 SFLFLEPMEEKLWLPFPKEKARTAECPGPAQNASP 377 

I : : I I I III I 

Db 328 HPGKVPWVQP SVGAAPGPVITPCP 352 



RESULT 1 
US-08-737-045-14 

; Sequence 14, Application US/08737045A 

; Patent No. 5981483 

; GENERAL INFORMATION: 

; APPLICANT: Dennis, James W. 

; APPLICANT: Denetriou, Michael 

APPLICANT: Mount Sinai Hospital Corporation 



; TITLE OF INVENTION: COMPOSITIONS COMPRISING MODULATORS OF CYTOKINES OF THE 
; TITLE OF INVENTION: TGFb SUPERFAMILY AND A METHOD OF TREATMENT WITH SUCH A 
; TITLE OF INVENTION: COMPOSITION (AS AMENDED) 
; FILE REFERENCE: 7933 . 94USWO 

; CURRENT APPLICATION NUMBER: US/08/737 , 045A 
; CURRENT FILING DATE: 1997-03-20 
; NUMBER OF SEQ ID NOS : 14 
; SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 14 

LENGTH: 367 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-08-737-045-14 



Query Match 13.6%; Score 274; DB 2; Length 367; 

Best Local Similarity 26.2%; Pred. No. l.le-19; 

Matches 102; Conservative 64; Mismatches 169; Indels 54; Gaps 17; 

Qy 11 ILVLCCGAMSPPQLALNPSALLSR- -GCNDSDVLAVAGFALRDINKDRKDGYVLRLNRVN 68 

:|:|| : I = 1=1 1 = 1= 11= ll= = II 11 = : = 

Db 5 VLLLCLAQLWGCHSAPHGPGLIYRQPNCDDPETEEAALVAIDYINQNLPWGYKHTLNQID 64 

Qy 69 DAQEYRRGGLGSLFYLTLDVLETDCHVLRKKAWQDCGMRIFFE-SVYGQCK-AIFYMNNP 126 

= = = = I II = =1 III Mil 1=1 I =1 I I 

Db 65 EVKVWPQQPSGELFEIEIDTLETTCHVLDPTPVARCSVRQLKEHAVEGDCDFQLLKLDGK 124 

Qy 12 7 SRVLYLAAYNCTLRPVSKKKIYMTCPDCPSSIPTDSSNHQVLEAATESLAKYNNENTSKQ 186 

1 = 1 I II : : I III I = = =1 = II =11 =1 =1 

Db 125 FSWYA KCDSS PDSAEDVRKVCQDCPLLAPLNDT - - RWHAAKAALAAFNAQNNGSN 179 

Qy 187 YSLFKVTRASSQWV-VGPSYFVEYLIKESPCT KSQASSCSLQSSDSVPVGLCKGSLT 242 

: | :::|| | | : || :||: : : | ::|: |:| : | || :|: 

Db 180 FQLEEISRA- -QLVPLPPSTYVEFTVSGTDCVAKEATEAAKCNLLAEKQ- - YGFCKATLS 235 

Qy 243 RTHWEKFVSVTCDFFESQAPATGSENSAVNQK-PTNLPKVEESQQKNTPPTDSPSKAGPR 3 01 

1 = 111 l = = l = = 1=11 I :|| I | 

Db 236 EKLGGAEVAVTCTVFQTQPVTSQPQPEGANEAVPT PWDPDAPPSPPLGAPG-LPPA 291 

Qy 302 GSVQYLPDLDDKNSQEKGPQEAFPVHLDL TTNPQGETLD I S FLFLE PMEEK 352 

II II =11 III =1 II 
Db 292 GSP PDSHVLLAAPPGHQ-LHRAHYDLRHTFMGWSLGSPSGE 332 

Qy 353 LWLPFPKEKARTAECPGPAQNASPLVLP 381 

: 1=111 I I hi I 

Db 333 VSHPR-KTRTWQPSVGAAAGPWPP 357 



RESULT 4 
US-08-483-926A-10 

; Sequence 10, Application US/08483926A 
; Patent No. 5821227 
; GENERAL INFORMATION: 

APPLICANT: Dennis, James W. 

TITLE OF INVENTION: MODULATORS OF CYTOKINES OF THE TGF BETA 
TITLE OF INVENTION: SUPERFAMILY AND METHODS FOR ASSAYING FOR SAME 
NUMBER OF SEQUENCES: 13 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: BERESKIN & PARR 

STREET: 4 0 king Street West 

CITY: Toronto 

STATE: Ontario 

COUNTRY : Canada 

ZIP: M5H 3Y2 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 



APPLICATION NUMBER: US/08/483 , 926A 

FILING DATE: 07-JUN-1995 

CLASSIFICATION: 514 
ATTORNEY/AGENT INFORMATION: 

NAME: Kurdydyk, Linda M. 

REGISTRATION NUMBER: 34,971 

REFERENCE/DOCKET NUMBER: 3153-155 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (416) 3 64-7311 

TELEFAX: (416) 361-1398 
INFORMATION FOR SEQ ID NO : 10: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 364 amino acids 

TYPE: amino acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
ORIGINAL SOURCE: 

ORGANISM: Sheep 
US-08-483-926A-10 



Query Match 13.2%; Score 265.5; DB 2; Length 364; 

Best Local Similarity 29.5%; Pred. No. 7.7e-19; 

92; Conservative 49; Mismatches 136; Indels 35; Gaps 

VLCILVLCCGAMSPPQLALNPSA-LLSRGCNDSDVLAVAGFALRDINKDRKDGYVLRLN 65 

h II I : hi I hi I I h III II II 

jFCLAQLC SCRSIPLDPIAGYKEPACDDPDTEQAALAAVDYINKHLPRGYKHTLN 61 

/NDAQEYRRGGLGSLFYLTLDVLETDCHVLRKKAWQDCGMRI FFE - SVYGQCKA- I FYM 123 

: = I | : : : : | ||| |||| :| :| | :| | | : 



Ih I ' Mill I H Ih II HI =1 =1 

JFSVLFT- - -KCDSSPDSAEDVRKLCPDCPLLAPL- - NN S Q WHAAE VALAT FN AQNN 176 

JYSLFKVTRASSQWWGP-SYFVEYLIKESPCTKSQA SSCSLQSSDSVPVGLCKG 239 

= I -HI hi I I Ih : : I : = hi = I III 

fFQLVEISRA- -QFVPLPGSVSVEFAVAATDCIAKEWDPTKCNLLAEKQ- -YGFCKG 232 

240 S - LTRTHWEKFVS VTCDFFES QAPATGSENSAVNQKPTNLPKVEESQQKNTPPTD 293 

: |:||| |:: | |:| | |: :| 
IQKALGGEDVTVTCTLFQTQPVIPQPQPEGAEAGA PSAVPDAAVPD AAVP 2 84 



Matches 


Qy 


7 


Db 


6 


QY 


, 66 


Db 


62 


Qy 


124 


Db 


122 


Qy 


184 


Db 


177 


Qy 


240 


Db 


233 


Qy 


294 


Db 


285 



II II I III 



RESULT 15 
US-08-193-114B-1 

; Sequence 1, Application US/08193114B 
; Patent No. 5472945 
; GENERAL INFORMATION: 

APPLICANT: Schmaier, Alvin H. 
; APPLICANT: Jiang, Yongping 

TITLE OF INVENTION: Modulation of Blood 

TITLE OF INVENTION: Pressure and Inhibition of Platelet Activation 
TITLE OF INVENTION: with Kininogen Fragment 
NUMBER OF SEQUENCES: 2 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Seidel, Gonda, Lavorgna & 

ADDRESSEE: Monaco, P.C. 
; STREET: 1800 Two Perm Center Plaza 

CITY: Philadelphia 

STATE: Pennsylvania 

COUNTRY : U.S.A. 
ZIP : 19102 
COMPUTER READABLE FORM: 



' MEDIUM TYPE: Diskette, 3.50 inch, 720 Kb 
COMPUTER: IBM PS/2 
OPERATING SYSTEM: MS-DOS 
SOFTWARE: WordPerfect 5.1 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/193 , 114B 
FILING DATE: 9 February 1994 
CLASSIFICATION: 514 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: U.S. Application 
APPLICATION NUMBER: Serial No. 5472945 07/744,545 
FILING DATE: 13 August 1991 
ATTORNEY/ AGENT INFORMATION : 
NAME: Monaco, Daniel A. 
REGISTRATION NUMBER: 30,480 
REFERENCE/DOCKET NUMBER: 6056-137 CI1 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (215) 568-8383 
TELEFAX: (215) 568-5549 
TELEX: No. 5472945e 
INFORMATION FOR SEQ ID NO : 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 117 amino acids 
TYPE: peptide 
TOPOLOGY: linear 
US-08-193-114B-1 



Query Match 6.4%; Score 129.5; DB 1; Length 117; 

Best Local Similarity 30.4%; Pred. No. le-05; 

Matches 35; Conservative 18; Mismatches 43; Indels 19; Gaps 

Qy 151 CPDCPSSIPTD SS NH Q VLE AAT E S LAK YNNENT S KQ Y - S L F KVTRAS S QWWG P S Y F VE Y 209 

I II MM I I II II = I = I =1 I I I II-: 

Db 1 CVGCPRDI PTNSP - - ELEETLTHTITKLNAENNATFYFKIDNVKKARVQWAGKKYFIDF 58 

Qy 210 LIKESPCTKSQ ASSCSL QSSDSVPVGLCKGSLTRTHWEK- -FVSVTC 254 

: M hi II III I = III : = I I 

Db 59 VARETTCSKESNEELTESCETKKLGQSLD CNAEVYWPWEKKIYPTVNC 107 



